Building part compositions for hierarchical object recognition

نویسنده

  • Greg Nicholas
چکیده

The visual signature of objects in our world follow a particular blueprint: every object is composed of an arrangement of smaller visual "parts". These parts can be thought of on many different scales. An image of an car can be considered to be pattern of wheel, window, door, and various other compositional parts, or on a much more granular scale as a massive pattern of interconnected line segment parts. These levels of granularity lend themselves well to a hierarchical organization of visual blueprints: the car can be decomposed into doors and other similarly-scaled parts, each door can be decomposed into a window, a handle, and other features, and this reduction in scale can continue until a suitable base level of elemental parts is reached. In their 2007 paper, "Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts" [3], Fidler and Leonardis described an algorithm to implement such a categorization scheme with minimal human intervention. Given a large corpus of images, they showed their algorithm organizing subparts into patterns completely automatically from the bottom elemental layer to parts three layers higher. At this level, parts became too specialized to learn with a generic image corpus, so further learning was done on manually separated sets of images grouped by categories (e.g. faces, mugs, etc.). According to the authors, this was the only source of significant human intervention in the process. This work has attempted to recreate their algorithm from scratch. Unfortunately, this task was not successfully completed. Although the entire framework of the algorithm was written, automating the core pattern processing part of it proved to be a very nuanced and difficult task to complete. This paper will discuss the nature of this difficulty and describe the various successes and setbacks in developing a principled way to process the patterns in the data automatically.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Urban Vegetation Recognition Based on the Decision Level Fusion of Hyperspectral and Lidar Data

Introduction: Information about vegetation cover and their health has always been interesting to ecologists due to its importance in terms of habitat, energy production and other important characteristics of plants on the earth planet. Nowadays, developments in remote sensing technologies caused more remotely sensed data accessible to researchers. The combination of these data improves the obje...

متن کامل

Learning Hierarchical Representations of Object Categories for Robot Vision

This paper presents our recently developed approach to constructing a hierarchical representation of visual input that aims to enable recognition and detection of a large number of object categories. Inspired by the principles of efficient indexing, robust matching, and ideas of compositionality, our approach learns a hierarchy of spatially flexible compositions, i.e. parts, in an unsupervised,...

متن کامل

Hierarchical Spectro-Temporal Models for Speech Recognition

We seek to explore computational approaches for audition that are inspired by computational visual neuroscience. In particular, we seek to leverage recent progress over the past few years in building a biologically-faithful hierarchical, feed-forward system for visual object recognition [13,14]. The system, which was designed to closely match the currently known feed-forward path in the ventral...

متن کامل

Learning Hierarchical Sparse Representations using Iterative Dictionary Learning and Dimension Reduction

This paper introduces an elemental building block which combines Dictionary Learning and Dimension Reduction (DRDL). We show how this foundational element can be used to iteratively construct a Hierarchical Sparse Representation (HSR) of a sensory stream. We compare our approach to existing models showing the generality of our simple prescription. We then perform preliminary experiments using t...

متن کامل

Efficient Clustering and Matching for Object Class Recognition

In this paper we address the problem of building object class representations based on local features and fast matching in a large database. We propose an efficient algorithm for hierarchical agglomerative clustering. We examine different agglomerative and partitional clustering strategies and compare the quality of obtained clusters. Our combination of partitional-agglomerative clustering give...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009